Viewing Adaptive Migration Policies for Tiered Storage Systems as a Supervised Learning Problem

نویسندگان

  • Damian Eads
  • Karen Glocer
چکیده

Many file migration algorithms rely on simple, unchanging, automated heuristics to make file placement decisions for exclusively hierarchical storage systems. Such approaches cannot adapt to changes in the workload or data center configuration. Systems with manually-tuned policies offer a way to deal with changes but require well-trained administrators, thereby driving up the cost of storage management. We are developing an automated and adaptive file migration system for arbitrarily large, tiered storage systems with general topologies. We propose to use techniques developed in the field of machine learning to develop an adaptive migration policy. We frame this policy as a supervised learning problem where the labeled data is derived from file system traces. We use the information gathered from file system traces to solve a regression problem in which we try to predict user workload. Migration decisions are made based on this predicted workload. We do this by defining a cost function that considers the performance characteristics of each part of the tiered storage system. We discuss the implementation of some initial parts of the system, in particular the trace processor and replaying simulator. We then present some preliminary experiments conducted with these parts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proactive and Adaptive Data Migration in Hierarchical Storage Systems using Reinforcement Learning Agent

With the data generation rates growing exponentially, businesses are having a difficult time maintaining data center infrastructure. Hierarchical storage systems has evolved as a better alternate to managing data, as frequently accessed data is placed on higher tiers and the least frequently accessed data on lower tiers. But the data arrangement is not always static. Data Migration is an operat...

متن کامل

INTEGRATED ADAPTIVE FUZZY CLUSTERING (IAFC) NEURAL NETWORKS USING FUZZY LEARNING RULES

The proposed IAFC neural networks have both stability and plasticity because theyuse a control structure similar to that of the ART-1(Adaptive Resonance Theory) neural network.The unsupervised IAFC neural network is the unsupervised neural network which uses the fuzzyleaky learning rule. This fuzzy leaky learning rule controls the updating amounts by fuzzymembership values. The supervised IAFC ...

متن کامل

FDTM: Block Level Data Migration Policy in Tiered Storage System

ILM and tiered storage system are designed to address the challenge of achieving balance between cost and storage performance. However, both of them are hard to implement fully automatic data migration by traditional solutions for which are mainly relying on administrators experience and need huge manual work for data migration according to storage configuration and IO access patterns. This pap...

متن کامل

Dynamic Tuning of Online Data Migration Policies in Hierarchical Storage Systems using Reinforcement Learning

Multi-tier storage systems are becoming more and more widespread in the industry. In order to minimize the request response time in such systems, the most frequently accessed (“hot”) files should be located in the fastest storage tiers (which are usually smaller and more expensive than the other tiers). Unfortunately, it is impossible to know ahead of time which files are going to be “hot”, esp...

متن کامل

Economic Load Dispatch using PSO Algorithm Based on Adaptive Learning Strategy Considering Valve point Effect

Abstract: In recent years due to problems such as population growth and as a result increase in demand for electrical energy, power systems have been faced with new challenges that not existed in the past. One of the most important issues in modern power systems is economic load dispatch, which is a complex optimization problem with a large number of variables and constraints. Due to the comple...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006